Statistical Generation: Three Methods Compared and Evaluated

نویسنده

  • Anja Belz
چکیده

Statistical NLG has largely meant n-gram modelling which has the considerable advantages of lending robustness to NLG systems, and of making automatic adaptation to new domains from raw corpora possible. On the downside, n-gram models are expensive to use as selection mechanisms and have a built-in bias towards shorter realisations. This paper looks at treebank-training of generators, an alternative method for building statistical models for NLG from raw corpora, and two different ways of using treebank-trained models during generation. Results show that the treebank-trained generators achieve improvements similar to a 2-gram generator over a baseline of random selection. However, the treebank-trained generators achieve this at a much lower cost than the 2-gram generator, and without its strong preference for shorter realisations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Appraisal of the evolutionary-based methodologies in generation of artificial earthquake time histories

Through the last three decades different seismological and engineering approaches for the generation of artificial earthquakes have been proposed. Selection of an appropriate method for the generation of applicable artificial earthquake accelerograms (AEAs) has been a challenging subject in the time history analysis of the structures in the case of the absence of sufficient recorded accelerogra...

متن کامل

Determination of the Colorants in Various Samples by Chemometric Methods Using Statistical Chemistry

partial least square and principal component regression methods were applied to various mixtures of Allura Red and Brilliant Blue to determine the concentrations. Colorants, at the same time, were analyzed with UV-spectrophotometry in chemical separation. The obtained experimental data have been evaluated by chemometric methods as Partial Least Squares (PLS) and Principle Component Regressi...

متن کامل

Online Torque Ripple Reduction in a Three-Phase 12 by 8 Switched Reluctance Motor Using Genetic Algorithm in PWM Generation

Despite a large number of advantages, Torque Ripple (TR) is the most important drawback of Switched Reluctance Motor (SRM). In the presented study, TR was reduced by optimizing the gate pulse angle of the SRM phase which played a leading role in the generated torque profile. For the Optimization, one of the strategies of Genetic Algorithm (GA) which was named Non-dominated Sorting Genetic Algor...

متن کامل

Specification of Hemato-Endothelial-Like Structures and Generation of Hematopoietic Progenitor Cells from Human Pluripotent Stem Cells

 Background and purpose: Human pluripotent stem cells (hPSCs) with the ability to differentiate into adult cells have provided a new perspective for treatment of some diseases. But, the efficiency of differentiation methods to generate hematopoietic progenitor cells (HPCs) is faced with multiple challenges. In the present study, we investigated the formation of hemato-endothelial-like structure...

متن کامل

The future status of solid waste generation in Tehran metropolis with regression analysis method based on population

Background and Objective: Knowledge about the quantity of municipal solid waste (MSW) generation plays a key role in formulating policies of waste management. So far, different methods have been applied to estimate the quantity of waste generation. In this study, eight specific forms of mathematical functions were evaluated to predict waste generation by the regression analysis method based on ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005